Initialization of Model Based Coding
نویسندگان
چکیده
Model based coding is a promising very low bit rate video coding technique. The idea behind model-based coding is to parameterize a talking head, and then to extract and transmit the parameters describing the facial movements, such as the facial expressions and the head motion to the receiver. At the receiver, the parameters are used to control face animation in order to reconstruct the talking head. Since only high-level animation parameters are required to drive face animation, very high compression can be achieved with this scheme. In this thesis, we focus on initialization of model based coding. The initialization problem of model based coding is to fit a generic face model onto a target face in the first video frame. Model fitting plays a key role and is the first step toward motion estimation and tracking in both model based coding and model-based tracking. Realizing that a fully automatic fitting is very difficult to achieve and that certain human intervention is inevitable, we propose a new strategy to do model fitting, pseudoautomatic fitting. The traditional initialization process is extended from the application stage only to include offline work. A manual fitting is done offline, where a generic face model is deformed into a personal face model. Another important work performed offline is to define, localize, and organize personal facial features. In the real application stage, only a simplified, automatic fitting operation is needed, detecting and matching the defined features from the first frame of application video. Different methods for implementing the initialization process are discussed in this thesis. The study of combining hardware based rendering with global search methods is presented first, followed by the study of using Dynamic Programming techniques to match facial features. The re-initialization problem is revisited and a new strategy is proposed for dealing with re-initialization.
منابع مشابه
Initialization, Parameters Extraction and Evaluation
This thesis covers topics relevant to model-based coding. Model-based coding is a promising very low bit rate video coding technique. The idea behind model-based coding is to parameterize a talking head and to extract and transmit the parameters describing facial movements. At the receiver, the parameters are used to reconstruct the talking head. Since only high-level animation parameters are t...
متن کاملA new nonlinear feature extraction algorithm for speaker verification
In this paper we propose a new feature extraction algorithm based on nonlinear prediction: the Neural Predictive Coding model which is an extension of the classical LPC one. This model is applied to speaker verification by the Arithmetic-Harmonic Sphericity (AHS) method. Two different initialization methods are proposed for the coding method based on the Neural Predictive Coding (NPC): classica...
متن کاملA new nonlinear speaker parameterization algorithm for speaker identification
In this paper we propose a new coding algorithm based on nonlinear prediction: the Neural Predictive Coding model which is an extension of the classical LPC one. The features performances are estimated by two different methods: the ArithmeticHarmonic Sphericity (AHS) and the Auto-Regressive Vectorial Models (ARVM). Two different methods are proposed for the coding method based on the Neural Pre...
متن کاملInitialization of Model Based Coding in Adaptive Video Transmission:An Analysis-By-Synthesis approach
This paper consider the problem of video content adaptation for the teleconference application in a heterogeneous network communication system. Due to the diversity of content providers and content consumers in the network, the content of the network transportation is often preferred to be adaptive for both efficiency and reliability in the communication. The video content adaptation could be a...
متن کاملJoint 3-D Motion/Disparity Segmentation for Object-Based Stereo Image Sequence Coding
An object-based coding scheme is proposed for the coding of the right channel of a stereoscopic image sequence, using motion and disparity information. A multiresolution block-based motion estimation approach is used for initialization, while disparity estimation is performed using a pixel-based hierarchical dynamic programming algorithm. A split and merge segmentation procedure based on both 3...
متن کامل